Picture for Qifan Wang

Qifan Wang

Meta AI

Bringing Reasoning to Generative Recommendation Through the Lens of Cascaded Ranking

Add code
Feb 03, 2026
Viaarxiv icon

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking

Add code
Feb 02, 2026
Viaarxiv icon

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching

Add code
Jan 27, 2026
Viaarxiv icon

On-the-Fly VLA Adaptation via Test-Time Reinforcement Learning

Add code
Jan 13, 2026
Viaarxiv icon

How Do Large Language Models Learn Concepts During Continual Pre-Training?

Add code
Jan 07, 2026
Viaarxiv icon

Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?

Add code
Oct 14, 2025
Figure 1 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 2 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 3 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 4 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

Add code
Jun 12, 2025
Viaarxiv icon

FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making

Add code
Jun 10, 2025
Viaarxiv icon

Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation

Add code
Jun 09, 2025
Figure 1 for Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation
Figure 2 for Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation
Figure 3 for Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation
Figure 4 for Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation
Viaarxiv icon